Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 7535 |
| Missing cells | 24808 |
| Missing cells (%) | 12.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Categorical | 11 |
|---|---|
| Numeric | 16 |
INSTNM has a high cardinality: 7535 distinct values | High cardinality |
CITY has a high cardinality: 2514 distinct values | High cardinality |
STABBR has a high cardinality: 59 distinct values | High cardinality |
MD_EARN_WNE_P10 has a high cardinality: 598 distinct values | High cardinality |
GRAD_DEBT_MDN_SUPP has a high cardinality: 2038 distinct values | High cardinality |
SATVRMID is highly correlated with SATMTMID | High correlation |
SATMTMID is highly correlated with SATVRMID | High correlation |
HBCU has 371 (4.9%) missing values | Missing |
MENONLY has 371 (4.9%) missing values | Missing |
WOMENONLY has 371 (4.9%) missing values | Missing |
SATVRMID has 6350 (84.3%) missing values | Missing |
SATMTMID has 6339 (84.1%) missing values | Missing |
DISTANCEONLY has 371 (4.9%) missing values | Missing |
UGDS has 661 (8.8%) missing values | Missing |
UGDS_WHITE has 661 (8.8%) missing values | Missing |
UGDS_BLACK has 661 (8.8%) missing values | Missing |
UGDS_HISP has 661 (8.8%) missing values | Missing |
UGDS_ASIAN has 661 (8.8%) missing values | Missing |
UGDS_AIAN has 661 (8.8%) missing values | Missing |
UGDS_NHPI has 661 (8.8%) missing values | Missing |
UGDS_2MOR has 661 (8.8%) missing values | Missing |
UGDS_NRA has 661 (8.8%) missing values | Missing |
UGDS_UNKN has 661 (8.8%) missing values | Missing |
PPTUG_EF has 682 (9.1%) missing values | Missing |
PCTPELL has 686 (9.1%) missing values | Missing |
PCTFLOAN has 686 (9.1%) missing values | Missing |
UG25ABV has 817 (10.8%) missing values | Missing |
MD_EARN_WNE_P10 has 1122 (14.9%) missing values | Missing |
UGDS_NHPI is highly skewed (γ1 = 22.78419429) | Skewed |
INSTNM is uniformly distributed | Uniform |
INSTNM has unique values | Unique |
UGDS_WHITE has 242 (3.2%) zeros | Zeros |
UGDS_BLACK has 562 (7.5%) zeros | Zeros |
UGDS_HISP has 588 (7.8%) zeros | Zeros |
UGDS_ASIAN has 1561 (20.7%) zeros | Zeros |
UGDS_AIAN has 2442 (32.4%) zeros | Zeros |
UGDS_NHPI has 3520 (46.7%) zeros | Zeros |
UGDS_2MOR has 2036 (27.0%) zeros | Zeros |
UGDS_NRA has 3906 (51.8%) zeros | Zeros |
UGDS_UNKN has 2067 (27.4%) zeros | Zeros |
PPTUG_EF has 1903 (25.3%) zeros | Zeros |
PCTFLOAN has 687 (9.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-20 19:39:12.550607 |
|---|---|
| Analysis finished | 2021-04-20 19:39:46.171164 |
| Duration | 33.62 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 7535 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 KiB |
| Gemini School of Visual Arts & Communication | 1 |
|---|---|
| Hebrew Union College-Jewish Institute of Religion | 1 |
| Harrison College-Morrisville | 1 |
| Hypnosis Motivation Institute | 1 |
| Rasmussen College–Romeoville/Joliet | 1 |
| Other values (7530) |
Length
| Max length | 93 |
|---|---|
| Median length | 30 |
| Mean length | 30.53497014 |
| Min length | 3 |
Characters and Unicode
| Total characters | 230081 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 7535 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Alabama A & M University |
|---|---|
| 2nd row | University of Alabama at Birmingham |
| 3rd row | Amridge University |
| 4th row | University of Alabama in Huntsville |
| 5th row | Alabama State University |
| Value | Count | Frequency (%) |
| Gemini School of Visual Arts & Communication | 1 | < 0.1% |
| Hebrew Union College-Jewish Institute of Religion | 1 | < 0.1% |
| Harrison College-Morrisville | 1 | < 0.1% |
| Hypnosis Motivation Institute | 1 | < 0.1% |
| Rasmussen College–Romeoville/Joliet | 1 | < 0.1% |
| Columbia College-Hollywood | 1 | < 0.1% |
| H Councill Trenholm State Community College | 1 | < 0.1% |
| Pivot Point Academy-Evanston | 1 | < 0.1% |
| Pellissippi State Community College | 1 | < 0.1% |
| Summit College | 1 | < 0.1% |
| Other values (7525) | 7525 |
| Value | Count | Frequency (%) |
| college | 2267 | 8.0% |
| of | 1919 | 6.8% |
| university | 1195 | 4.2% |
| school | 673 | 2.4% |
| beauty | 572 | 2.0% |
| institute | 553 | 2.0% |
| community | 546 | 1.9% |
| technical | 522 | 1.8% |
| state | 381 | 1.3% |
| academy | 320 | 1.1% |
| Other values (5774) | 19340 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 23636 | 10.3% |
| 20766 | 9.0% | |
| o | 16426 | 7.1% |
| i | 14711 | 6.4% |
| t | 14416 | 6.3% |
| l | 14342 | 6.2% |
| a | 14049 | 6.1% |
| n | 14009 | 6.1% |
| r | 10761 | 4.7% |
| s | 9419 | 4.1% |
| Other values (64) | 77546 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 176246 | |
| Uppercase Letter | 29630 | 12.9% |
| Space Separator | 20767 | 9.0% |
| Dash Punctuation | 3039 | 1.3% |
| Other Punctuation | 365 | 0.2% |
| Decimal Number | 31 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 23636 | |
| o | 16426 | |
| i | 14711 | 8.3% |
| t | 14416 | 8.2% |
| l | 14342 | 8.1% |
| a | 14049 | 8.0% |
| n | 14009 | 7.9% |
| r | 10761 | 6.1% |
| s | 9419 | 5.3% |
| c | 5917 | 3.4% |
| Other values (17) | 38560 |
| Value | Count | Frequency (%) |
| C | 6489 | |
| S | 3138 | |
| T | 2132 | 7.2% |
| U | 1894 | 6.4% |
| A | 1814 | 6.1% |
| B | 1773 | 6.0% |
| I | 1733 | 5.8% |
| M | 1678 | 5.7% |
| P | 1135 | 3.8% |
| H | 944 | 3.2% |
| Other values (16) | 6900 |
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 2 | 9 | |
| 4 | 3 | 9.7% |
| 3 | 2 | 6.5% |
| 0 | 2 | 6.5% |
| 9 | 2 | 6.5% |
| 6 | 2 | 6.5% |
| 5 | 1 | 3.2% |
| 7 | 1 | 3.2% |
| Value | Count | Frequency (%) |
| ' | 171 | |
| & | 156 | |
| . | 20 | 5.5% |
| / | 13 | 3.6% |
| # | 5 | 1.4% |
| Value | Count | Frequency (%) |
| 20766 | ||
| 1 | < 0.1% |
| Value | Count | Frequency (%) |
| - | 3029 | |
| – | 10 | 0.3% |
| Value | Count | Frequency (%) |
| ( | 1 |
| Value | Count | Frequency (%) |
| ) | 1 |
| Value | Count | Frequency (%) |
| ’ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 205876 | |
| Common | 24205 | 10.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 23636 | 11.5% |
| o | 16426 | 8.0% |
| i | 14711 | 7.1% |
| t | 14416 | 7.0% |
| l | 14342 | 7.0% |
| a | 14049 | 6.8% |
| n | 14009 | 6.8% |
| r | 10761 | 5.2% |
| s | 9419 | 4.6% |
| C | 6489 | 3.2% |
| Other values (43) | 67618 |
| Value | Count | Frequency (%) |
| 20766 | ||
| - | 3029 | 12.5% |
| ' | 171 | 0.7% |
| & | 156 | 0.6% |
| . | 20 | 0.1% |
| / | 13 | 0.1% |
| – | 10 | < 0.1% |
| 1 | 9 | < 0.1% |
| 2 | 9 | < 0.1% |
| # | 5 | < 0.1% |
| Other values (11) | 17 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 230067 | |
| Punctuation | 11 | < 0.1% |
| None | 3 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 23636 | 10.3% |
| 20766 | 9.0% | |
| o | 16426 | 7.1% |
| i | 14711 | 6.4% |
| t | 14416 | 6.3% |
| l | 14342 | 6.2% |
| a | 14049 | 6.1% |
| n | 14009 | 6.1% |
| r | 10761 | 4.7% |
| s | 9419 | 4.1% |
| Other values (60) | 77532 |
| Value | Count | Frequency (%) |
| – | 10 | |
| ’ | 1 | 9.1% |
| Value | Count | Frequency (%) |
| í | 2 | |
| 1 |
| Distinct | 2514 |
|---|---|
| Distinct (%) | 33.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 KiB |
| New York | 87 |
|---|---|
| Chicago | 78 |
| Houston | 72 |
| Los Angeles | 56 |
| Miami | 51 |
| Other values (2509) |
Length
| Max length | 24 |
|---|---|
| Median length | 9 |
| Mean length | 8.819110816 |
| Min length | 3 |
Characters and Unicode
| Total characters | 66452 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1418 ? |
|---|---|
| Unique (%) | 18.8% |
Sample
| 1st row | Normal |
|---|---|
| 2nd row | Birmingham |
| 3rd row | Montgomery |
| 4th row | Huntsville |
| 5th row | Montgomery |
| Value | Count | Frequency (%) |
| New York | 87 | 1.2% |
| Chicago | 78 | 1.0% |
| Houston | 72 | 1.0% |
| Los Angeles | 56 | 0.7% |
| Miami | 51 | 0.7% |
| San Antonio | 49 | 0.7% |
| Dallas | 48 | 0.6% |
| Philadelphia | 46 | 0.6% |
| Brooklyn | 46 | 0.6% |
| Jacksonville | 41 | 0.5% |
| Other values (2504) | 6961 |
| Value | Count | Frequency (%) |
| city | 190 | 2.0% |
| san | 177 | 1.8% |
| new | 151 | 1.6% |
| york | 101 | 1.1% |
| chicago | 80 | 0.8% |
| park | 79 | 0.8% |
| fort | 77 | 0.8% |
| saint | 76 | 0.8% |
| houston | 73 | 0.8% |
| beach | 71 | 0.7% |
| Other values (2332) | 8497 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6126 | 9.2% |
| e | 5810 | 8.7% |
| o | 5146 | 7.7% |
| n | 5037 | 7.6% |
| l | 4318 | 6.5% |
| i | 4218 | 6.3% |
| r | 4000 | 6.0% |
| t | 3494 | 5.3% |
| s | 2951 | 4.4% |
| 2045 | 3.1% | |
| Other values (48) | 23307 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54711 | |
| Uppercase Letter | 9657 | 14.5% |
| Space Separator | 2045 | 3.1% |
| Other Punctuation | 25 | < 0.1% |
| Dash Punctuation | 14 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 6126 | |
| e | 5810 | |
| o | 5146 | |
| n | 5037 | |
| l | 4318 | 7.9% |
| i | 4218 | 7.7% |
| r | 4000 | 7.3% |
| t | 3494 | 6.4% |
| s | 2951 | 5.4% |
| u | 1625 | 3.0% |
| Other values (18) | 11986 |
| Value | Count | Frequency (%) |
| C | 1011 | 10.5% |
| S | 976 | 10.1% |
| B | 732 | 7.6% |
| M | 721 | 7.5% |
| P | 673 | 7.0% |
| L | 620 | 6.4% |
| A | 589 | 6.1% |
| H | 494 | 5.1% |
| W | 446 | 4.6% |
| R | 406 | 4.2% |
| Other values (16) | 2989 |
| Value | Count | Frequency (%) |
| . | 20 | |
| ' | 5 | 20.0% |
| Value | Count | Frequency (%) |
| 2045 |
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64368 | |
| Common | 2084 | 3.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 6126 | 9.5% |
| e | 5810 | 9.0% |
| o | 5146 | 8.0% |
| n | 5037 | 7.8% |
| l | 4318 | 6.7% |
| i | 4218 | 6.6% |
| r | 4000 | 6.2% |
| t | 3494 | 5.4% |
| s | 2951 | 4.6% |
| u | 1625 | 2.5% |
| Other values (44) | 21643 |
| Value | Count | Frequency (%) |
| 2045 | ||
| . | 20 | 1.0% |
| - | 14 | 0.7% |
| ' | 5 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66449 | |
| None | 3 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 6126 | 9.2% |
| e | 5810 | 8.7% |
| o | 5146 | 7.7% |
| n | 5037 | 7.6% |
| l | 4318 | 6.5% |
| i | 4218 | 6.3% |
| r | 4000 | 6.0% |
| t | 3494 | 5.3% |
| s | 2951 | 4.4% |
| 2045 | 3.1% | |
| Other values (46) | 23304 |
| Value | Count | Frequency (%) |
| ó | 2 | |
| í | 1 |
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 KiB |
| CA | |
|---|---|
| TX | 472 |
| NY | 459 |
| FL | 436 |
| PA | 394 |
| Other values (54) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15070 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | AL |
|---|---|
| 2nd row | AL |
| 3rd row | AL |
| 4th row | AL |
| 5th row | AL |
| Value | Count | Frequency (%) |
| CA | 773 | 10.3% |
| TX | 472 | 6.3% |
| NY | 459 | 6.1% |
| FL | 436 | 5.8% |
| PA | 394 | 5.2% |
| OH | 352 | 4.7% |
| IL | 300 | 4.0% |
| MI | 207 | 2.7% |
| NC | 204 | 2.7% |
| MA | 194 | 2.6% |
| Other values (49) | 3744 |
| Value | Count | Frequency (%) |
| ca | 773 | 10.3% |
| tx | 472 | 6.3% |
| ny | 459 | 6.1% |
| fl | 436 | 5.8% |
| pa | 394 | 5.2% |
| oh | 352 | 4.7% |
| il | 300 | 4.0% |
| mi | 207 | 2.7% |
| nc | 204 | 2.7% |
| ma | 194 | 2.6% |
| Other values (49) | 3744 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2392 | |
| N | 1541 | 10.2% |
| C | 1341 | 8.9% |
| M | 1039 | 6.9% |
| I | 964 | 6.4% |
| L | 953 | 6.3% |
| O | 909 | 6.0% |
| T | 892 | 5.9% |
| Y | 576 | 3.8% |
| P | 544 | 3.6% |
| Other values (14) | 3919 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15070 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 2392 | |
| N | 1541 | 10.2% |
| C | 1341 | 8.9% |
| M | 1039 | 6.9% |
| I | 964 | 6.4% |
| L | 953 | 6.3% |
| O | 909 | 6.0% |
| T | 892 | 5.9% |
| Y | 576 | 3.8% |
| P | 544 | 3.6% |
| Other values (14) | 3919 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15070 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 2392 | |
| N | 1541 | 10.2% |
| C | 1341 | 8.9% |
| M | 1039 | 6.9% |
| I | 964 | 6.4% |
| L | 953 | 6.3% |
| O | 909 | 6.0% |
| T | 892 | 5.9% |
| Y | 576 | 3.8% |
| P | 544 | 3.6% |
| Other values (14) | 3919 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15070 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 2392 | |
| N | 1541 | 10.2% |
| C | 1341 | 8.9% |
| M | 1039 | 6.9% |
| I | 964 | 6.4% |
| L | 953 | 6.3% |
| O | 909 | 6.0% |
| T | 892 | 5.9% |
| Y | 576 | 3.8% |
| P | 544 | 3.6% |
| Other values (14) | 3919 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 371 |
| Missing (%) | 4.9% |
| Memory size | 59.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 102 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21492 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
| Value | Count | Frequency (%) |
| 0.0 | 7062 | |
| 1.0 | 102 | 1.4% |
| (Missing) | 371 | 4.9% |
| Value | Count | Frequency (%) |
| 0.0 | 7062 | |
| 1.0 | 102 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14226 | |
| . | 7164 | |
| 1 | 102 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14328 | |
| Other Punctuation | 7164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 14226 | |
| 1 | 102 | 0.7% |
| Value | Count | Frequency (%) |
| . | 7164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21492 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 14226 | |
| . | 7164 | |
| 1 | 102 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21492 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 14226 | |
| . | 7164 | |
| 1 | 102 | 0.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 371 |
| Missing (%) | 4.9% |
| Memory size | 59.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 66 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21492 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 7098 | |
| 1.0 | 66 | 0.9% |
| (Missing) | 371 | 4.9% |
| Value | Count | Frequency (%) |
| 0.0 | 7098 | |
| 1.0 | 66 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14262 | |
| . | 7164 | |
| 1 | 66 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14328 | |
| Other Punctuation | 7164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 14262 | |
| 1 | 66 | 0.5% |
| Value | Count | Frequency (%) |
| . | 7164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21492 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 14262 | |
| . | 7164 | |
| 1 | 66 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21492 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 14262 | |
| . | 7164 | |
| 1 | 66 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 371 |
| Missing (%) | 4.9% |
| Memory size | 59.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 38 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21492 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 7126 | |
| 1.0 | 38 | 0.5% |
| (Missing) | 371 | 4.9% |
| Value | Count | Frequency (%) |
| 0.0 | 7126 | |
| 1.0 | 38 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| . | 7164 | |
| 1 | 38 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14328 | |
| Other Punctuation | 7164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| 1 | 38 | 0.3% |
| Value | Count | Frequency (%) |
| . | 7164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21492 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| . | 7164 | |
| 1 | 38 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21492 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| . | 7164 | |
| 1 | 38 | 0.2% |
RELAFFIL
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7535 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7535 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7535 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7535 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 6096 | |
| 1 | 1439 | 19.1% |
| Distinct | 163 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 6350 |
| Missing (%) | 84.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 522.8194093 |
|---|---|
| Minimum | 290 |
| Maximum | 765 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 430 |
| Q1 | 475 |
| median | 510 |
| Q3 | 555 |
| 95-th percentile | 665 |
| Maximum | 765 |
| Range | 475 |
| Interquartile range (IQR) | 80 |
Descriptive statistics
| Standard deviation | 68.57886165 |
|---|---|
| Coefficient of variation (CV) | 0.1311712236 |
| Kurtosis | 1.074681792 |
| Mean | 522.8194093 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 0.8192137191 |
| Sum | 619541 |
| Variance | 4703.060265 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 495 | 52 | 0.7% |
| 475 | 43 | 0.6% |
| 500 | 39 | 0.5% |
| 470 | 37 | 0.5% |
| 490 | 36 | 0.5% |
| 520 | 34 | 0.5% |
| 530 | 34 | 0.5% |
| 505 | 34 | 0.5% |
| 510 | 32 | 0.4% |
| 480 | 31 | 0.4% |
| Other values (153) | 813 | 10.8% |
| (Missing) | 6350 |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 360 | 2 | |
| 363 | 1 | |
| 365 | 1 | |
| 380 | 2 |
| Value | Count | Frequency (%) |
| 765 | 1 | |
| 760 | 1 | |
| 755 | 1 | |
| 750 | 1 | |
| 745 | 2 |
| Distinct | 167 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 6339 |
| Missing (%) | 84.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 530.7650502 |
|---|---|
| Minimum | 310 |
| Maximum | 785 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 310 |
|---|---|
| 5-th percentile | 430 |
| Q1 | 482 |
| median | 520 |
| Q3 | 565 |
| 95-th percentile | 685 |
| Maximum | 785 |
| Range | 475 |
| Interquartile range (IQR) | 83 |
Descriptive statistics
| Standard deviation | 73.4697671 |
|---|---|
| Coefficient of variation (CV) | 0.1384223906 |
| Kurtosis | 0.8642076997 |
| Mean | 530.7650502 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 0.8598872434 |
| Sum | 634795 |
| Variance | 5397.806677 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 490 | 47 | 0.6% |
| 520 | 43 | 0.6% |
| 500 | 42 | 0.6% |
| 480 | 38 | 0.5% |
| 485 | 36 | 0.5% |
| 510 | 35 | 0.5% |
| 505 | 35 | 0.5% |
| 525 | 34 | 0.5% |
| 495 | 33 | 0.4% |
| 515 | 33 | 0.4% |
| Other values (157) | 820 | 10.9% |
| (Missing) | 6339 |
| Value | Count | Frequency (%) |
| 310 | 1 | |
| 360 | 1 | |
| 365 | 1 | |
| 368 | 1 | |
| 375 | 2 |
| Value | Count | Frequency (%) |
| 785 | 1 | < 0.1% |
| 770 | 2 | |
| 760 | 2 | |
| 758 | 1 | < 0.1% |
| 755 | 3 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 371 |
| Missing (%) | 4.9% |
| Memory size | 59.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 40 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21492 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 7124 | |
| 1.0 | 40 | 0.5% |
| (Missing) | 371 | 4.9% |
| Value | Count | Frequency (%) |
| 0.0 | 7124 | |
| 1.0 | 40 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| . | 7164 | |
| 1 | 40 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14328 | |
| Other Punctuation | 7164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| 1 | 40 | 0.3% |
| Value | Count | Frequency (%) |
| . | 7164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21492 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| . | 7164 | |
| 1 | 40 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21492 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| . | 7164 | |
| 1 | 40 | 0.2% |
| Distinct | 2932 |
|---|---|
| Distinct (%) | 42.7% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2356.83794 |
|---|---|
| Minimum | 0 |
| Maximum | 151558 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31.65 |
| Q1 | 117 |
| median | 412.5 |
| Q3 | 1929.5 |
| 95-th percentile | 11858.05 |
| Maximum | 151558 |
| Range | 151558 |
| Interquartile range (IQR) | 1812.5 |
Descriptive statistics
| Standard deviation | 5474.275871 |
|---|---|
| Coefficient of variation (CV) | 2.322720531 |
| Kurtosis | 103.7582136 |
| Mean | 2356.83794 |
| Median Absolute Deviation (MAD) | 361.5 |
| Skewness | 6.806530158 |
| Sum | 16200904 |
| Variance | 29967696.31 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 39 | 29 | 0.4% |
| 61 | 26 | 0.3% |
| 46 | 26 | 0.3% |
| 58 | 26 | 0.3% |
| 38 | 25 | 0.3% |
| 60 | 25 | 0.3% |
| 47 | 25 | 0.3% |
| 95 | 24 | 0.3% |
| 43 | 24 | 0.3% |
| 63 | 24 | 0.3% |
| Other values (2922) | 6620 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 1 | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 4 | 4 |
| Value | Count | Frequency (%) |
| 151558 | 1 | |
| 77657 | 1 | |
| 61470 | 1 | |
| 59920 | 1 | |
| 58084 | 1 |
| Distinct | 4397 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.510207201 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 242 |
| Zeros (%) | 3.2% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.013265 |
| Q1 | 0.2675 |
| median | 0.5557 |
| Q3 | 0.747875 |
| 95-th percentile | 0.927315 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.480375 |
Descriptive statistics
| Standard deviation | 0.286958349 |
|---|---|
| Coefficient of variation (CV) | 0.5624349254 |
| Kurtosis | -1.079241841 |
| Mean | 0.510207201 |
| Median Absolute Deviation (MAD) | 0.227 |
| Skewness | -0.2560443813 |
| Sum | 3507.1643 |
| Variance | 0.08234509408 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 242 | 3.2% |
| 1 | 109 | 1.4% |
| 0.6667 | 22 | 0.3% |
| 0.5 | 18 | 0.2% |
| 0.8 | 15 | 0.2% |
| 0.6 | 13 | 0.2% |
| 0.8333 | 13 | 0.2% |
| 0.3333 | 13 | 0.2% |
| 0.5714 | 12 | 0.2% |
| 0.8571 | 11 | 0.1% |
| Other values (4387) | 6406 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 242 | |
| 0.0004 | 2 | < 0.1% |
| 0.0005 | 2 | < 0.1% |
| 0.0007 | 1 | < 0.1% |
| 0.001 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 109 | |
| 0.9968 | 1 | < 0.1% |
| 0.9964 | 1 | < 0.1% |
| 0.995 | 1 | < 0.1% |
| 0.9948 | 1 | < 0.1% |
| Distinct | 3242 |
|---|---|
| Distinct (%) | 47.2% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1899966395 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 562 |
| Zeros (%) | 7.5% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.036125 |
| median | 0.10005 |
| Q3 | 0.2577 |
| 95-th percentile | 0.726715 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.221575 |
Descriptive statistics
| Standard deviation | 0.224586518 |
|---|---|
| Coefficient of variation (CV) | 1.182055212 |
| Kurtosis | 2.485739193 |
| Mean | 0.1899966395 |
| Median Absolute Deviation (MAD) | 0.08145 |
| Skewness | 1.733201177 |
| Sum | 1306.0369 |
| Variance | 0.05043910407 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 562 | 7.5% |
| 1 | 28 | 0.4% |
| 0.0476 | 16 | 0.2% |
| 0.1429 | 15 | 0.2% |
| 0.0345 | 14 | 0.2% |
| 0.1111 | 14 | 0.2% |
| 0.1875 | 13 | 0.2% |
| 0.1667 | 13 | 0.2% |
| 0.2 | 13 | 0.2% |
| 0.0588 | 13 | 0.2% |
| Other values (3232) | 6173 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 562 | |
| 0.0002 | 1 | < 0.1% |
| 0.0003 | 1 | < 0.1% |
| 0.0005 | 1 | < 0.1% |
| 0.0006 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 28 | |
| 0.994 | 1 | < 0.1% |
| 0.9911 | 1 | < 0.1% |
| 0.9908 | 1 | < 0.1% |
| 0.9899 | 1 | < 0.1% |
| Distinct | 2809 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1616348851 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 588 |
| Zeros (%) | 7.8% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.0276 |
| median | 0.0714 |
| Q3 | 0.198875 |
| 95-th percentile | 0.6667 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.171275 |
Descriptive statistics
| Standard deviation | 0.2218537907 |
|---|---|
| Coefficient of variation (CV) | 1.372561317 |
| Kurtosis | 4.88840692 |
| Mean | 0.1616348851 |
| Median Absolute Deviation (MAD) | 0.056 |
| Skewness | 2.251145805 |
| Sum | 1111.0782 |
| Variance | 0.04921910443 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 588 | 7.8% |
| 1 | 136 | 1.8% |
| 0.0435 | 16 | 0.2% |
| 0.1111 | 16 | 0.2% |
| 0.0556 | 16 | 0.2% |
| 0.1 | 15 | 0.2% |
| 0.2 | 15 | 0.2% |
| 0.0359 | 15 | 0.2% |
| 0.0417 | 15 | 0.2% |
| 0.0952 | 15 | 0.2% |
| Other values (2799) | 6027 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 588 | |
| 0.0007 | 3 | < 0.1% |
| 0.001 | 1 | < 0.1% |
| 0.0012 | 2 | < 0.1% |
| 0.0014 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 136 | |
| 0.9991 | 1 | < 0.1% |
| 0.9981 | 1 | < 0.1% |
| 0.9978 | 1 | < 0.1% |
| 0.9974 | 3 | < 0.1% |
| Distinct | 1254 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03354423916 |
|---|---|
| Minimum | 0 |
| Maximum | 0.9727 |
| Zeros | 1561 |
| Zeros (%) | 20.7% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.0025 |
| median | 0.0129 |
| Q3 | 0.0327 |
| 95-th percentile | 0.131775 |
| Maximum | 0.9727 |
| Range | 0.9727 |
| Interquartile range (IQR) | 0.0302 |
Descriptive statistics
| Standard deviation | 0.07377719782 |
|---|---|
| Coefficient of variation (CV) | 2.199399947 |
| Kurtosis | 58.81975651 |
| Mean | 0.03354423916 |
| Median Absolute Deviation (MAD) | 0.0129 |
| Skewness | 6.489844249 |
| Sum | 230.5831 |
| Variance | 0.005443074919 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1561 | 20.7% |
| 0.0074 | 30 | 0.4% |
| 0.0086 | 27 | 0.4% |
| 0.0112 | 26 | 0.3% |
| 0.007 | 25 | 0.3% |
| 0.009 | 25 | 0.3% |
| 0.0064 | 24 | 0.3% |
| 0.0154 | 24 | 0.3% |
| 0.0054 | 24 | 0.3% |
| 0.0095 | 24 | 0.3% |
| Other values (1244) | 5084 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 1561 | |
| 0.0002 | 3 | < 0.1% |
| 0.0003 | 1 | < 0.1% |
| 0.0004 | 2 | < 0.1% |
| 0.0005 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.9727 | 1 | |
| 0.967 | 1 | |
| 0.9658 | 1 | |
| 0.9595 | 1 | |
| 0.9524 | 1 |
| Distinct | 601 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0138125691 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 2442 |
| Zeros (%) | 32.4% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.0026 |
| Q3 | 0.0073 |
| 95-th percentile | 0.03641 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.0073 |
Descriptive statistics
| Standard deviation | 0.07019573509 |
|---|---|
| Coefficient of variation (CV) | 5.082018745 |
| Kurtosis | 134.7402217 |
| Mean | 0.0138125691 |
| Median Absolute Deviation (MAD) | 0.0026 |
| Skewness | 11.08329272 |
| Sum | 94.9476 |
| Variance | 0.004927441224 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2442 | |
| 0.002 | 68 | 0.9% |
| 0.0022 | 64 | 0.8% |
| 0.0024 | 63 | 0.8% |
| 0.0015 | 60 | 0.8% |
| 0.0023 | 58 | 0.8% |
| 0.0036 | 58 | 0.8% |
| 0.0018 | 57 | 0.8% |
| 0.0043 | 55 | 0.7% |
| 0.0025 | 54 | 0.7% |
| Other values (591) | 3895 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 2442 | |
| 0.0003 | 5 | 0.1% |
| 0.0004 | 21 | 0.3% |
| 0.0005 | 22 | 0.3% |
| 0.0006 | 20 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0.9944 | 1 | |
| 0.9894 | 1 | |
| 0.9887 | 1 | |
| 0.9821 | 1 |
| Distinct | 363 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004568897294 |
|---|---|
| Minimum | 0 |
| Maximum | 0.9983 |
| Zeros | 3520 |
| Zeros (%) | 46.7% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.0025 |
| 95-th percentile | 0.0152 |
| Maximum | 0.9983 |
| Range | 0.9983 |
| Interquartile range (IQR) | 0.0025 |
Descriptive statistics
| Standard deviation | 0.0331250764 |
|---|---|
| Coefficient of variation (CV) | 7.25012498 |
| Kurtosis | 600.1361399 |
| Mean | 0.004568897294 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.78419429 |
| Sum | 31.4066 |
| Variance | 0.001097270687 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3520 | |
| 0.0007 | 105 | 1.4% |
| 0.0009 | 98 | 1.3% |
| 0.0011 | 95 | 1.3% |
| 0.0008 | 93 | 1.2% |
| 0.001 | 93 | 1.2% |
| 0.0006 | 92 | 1.2% |
| 0.0014 | 85 | 1.1% |
| 0.0018 | 83 | 1.1% |
| 0.0004 | 78 | 1.0% |
| Other values (353) | 2532 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 3520 | |
| 0.0001 | 9 | 0.1% |
| 0.0002 | 31 | 0.4% |
| 0.0003 | 37 | 0.5% |
| 0.0004 | 78 | 1.0% |
| Value | Count | Frequency (%) |
| 0.9983 | 1 | |
| 0.9917 | 1 | |
| 0.9881 | 1 | |
| 0.9538 | 1 | |
| 0.9193 | 1 |
| Distinct | 957 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0239503055 |
|---|---|
| Minimum | 0 |
| Maximum | 0.5333 |
| Zeros | 2036 |
| Zeros (%) | 27.0% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.0175 |
| Q3 | 0.0339 |
| 95-th percentile | 0.0769 |
| Maximum | 0.5333 |
| Range | 0.5333 |
| Interquartile range (IQR) | 0.0339 |
Descriptive statistics
| Standard deviation | 0.03128804781 |
|---|---|
| Coefficient of variation (CV) | 1.306373641 |
| Kurtosis | 36.19576555 |
| Mean | 0.0239503055 |
| Median Absolute Deviation (MAD) | 0.0175 |
| Skewness | 4.127378462 |
| Sum | 164.6344 |
| Variance | 0.0009789419357 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 0.0217 | 22 | 0.3% |
| 0.0294 | 21 | 0.3% |
| 0.0208 | 20 | 0.3% |
| 0.0213 | 19 | 0.3% |
| 0.0172 | 19 | 0.3% |
| 0.0303 | 19 | 0.3% |
| 0.0167 | 19 | 0.3% |
| 0.019 | 18 | 0.2% |
| 0.0238 | 18 | 0.2% |
| Other values (947) | 4663 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 0.0001 | 1 | < 0.1% |
| 0.0002 | 4 | 0.1% |
| 0.0003 | 6 | 0.1% |
| 0.0004 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 0.5333 | 1 | |
| 0.4369 | 1 | |
| 0.4265 | 1 | |
| 0.3897 | 1 | |
| 0.3459 | 1 |
| Distinct | 920 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01608581612 |
|---|---|
| Minimum | 0 |
| Maximum | 0.9286 |
| Zeros | 3906 |
| Zeros (%) | 51.8% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.0117 |
| 95-th percentile | 0.078135 |
| Maximum | 0.9286 |
| Range | 0.9286 |
| Interquartile range (IQR) | 0.0117 |
Descriptive statistics
| Standard deviation | 0.05017188539 |
|---|---|
| Coefficient of variation (CV) | 3.119013982 |
| Kurtosis | 101.2317396 |
| Mean | 0.01608581612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.284843909 |
| Sum | 110.5739 |
| Variance | 0.002517218083 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3906 | |
| 0.0018 | 27 | 0.4% |
| 0.0015 | 24 | 0.3% |
| 0.0009 | 24 | 0.3% |
| 0.0007 | 23 | 0.3% |
| 0.002 | 22 | 0.3% |
| 0.0029 | 22 | 0.3% |
| 0.003 | 21 | 0.3% |
| 0.0012 | 21 | 0.3% |
| 0.0013 | 20 | 0.3% |
| Other values (910) | 2764 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 3906 | |
| 0.0001 | 7 | 0.1% |
| 0.0002 | 8 | 0.1% |
| 0.0003 | 11 | 0.1% |
| 0.0004 | 13 | 0.2% |
| Value | Count | Frequency (%) |
| 0.9286 | 1 | |
| 0.8942 | 1 | |
| 0.8718 | 1 | |
| 0.8 | 1 | |
| 0.7944 | 1 |
| Distinct | 1517 |
|---|---|
| Distinct (%) | 22.1% |
| Missing | 661 |
| Missing (%) | 8.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0451814373 |
|---|---|
| Minimum | 0 |
| Maximum | 0.9027 |
| Zeros | 2067 |
| Zeros (%) | 27.4% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.0143 |
| Q3 | 0.0454 |
| 95-th percentile | 0.200575 |
| Maximum | 0.9027 |
| Range | 0.9027 |
| Interquartile range (IQR) | 0.0454 |
Descriptive statistics
| Standard deviation | 0.0934404091 |
|---|---|
| Coefficient of variation (CV) | 2.06811502 |
| Kurtosis | 23.72572674 |
| Mean | 0.0451814373 |
| Median Absolute Deviation (MAD) | 0.0143 |
| Skewness | 4.353223011 |
| Sum | 310.5772 |
| Variance | 0.008731110053 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2067 | |
| 0.0085 | 21 | 0.3% |
| 0.0068 | 20 | 0.3% |
| 0.0182 | 19 | 0.3% |
| 0.002 | 18 | 0.2% |
| 0.0098 | 17 | 0.2% |
| 0.0056 | 16 | 0.2% |
| 0.0045 | 15 | 0.2% |
| 0.0175 | 15 | 0.2% |
| 0.0094 | 15 | 0.2% |
| Other values (1507) | 4651 | |
| (Missing) | 661 | 8.8% |
| Value | Count | Frequency (%) |
| 0 | 2067 | |
| 0.0001 | 1 | < 0.1% |
| 0.0002 | 4 | 0.1% |
| 0.0003 | 4 | 0.1% |
| 0.0004 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 0.9027 | 1 | |
| 0.8785 | 1 | |
| 0.8745 | 1 | |
| 0.8668 | 1 | |
| 0.8625 | 1 |
| Distinct | 3420 |
|---|---|
| Distinct (%) | 49.9% |
| Missing | 682 |
| Missing (%) | 9.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2266389902 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 1903 |
| Zeros (%) | 25.3% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.1504 |
| Q3 | 0.3769 |
| 95-th percentile | 0.71062 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3769 |
Descriptive statistics
| Standard deviation | 0.2464701974 |
|---|---|
| Coefficient of variation (CV) | 1.087501304 |
| Kurtosis | 0.216493156 |
| Mean | 0.2266389902 |
| Median Absolute Deviation (MAD) | 0.1504 |
| Skewness | 1.019221488 |
| Sum | 1553.157 |
| Variance | 0.06074755822 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1903 | 25.3% |
| 1 | 44 | 0.6% |
| 0.3333 | 15 | 0.2% |
| 0.25 | 12 | 0.2% |
| 0.5 | 11 | 0.1% |
| 0.2 | 10 | 0.1% |
| 0.1429 | 8 | 0.1% |
| 0.2857 | 8 | 0.1% |
| 0.2308 | 7 | 0.1% |
| 0.4 | 7 | 0.1% |
| Other values (3410) | 4828 | |
| (Missing) | 682 | 9.1% |
| Value | Count | Frequency (%) |
| 0 | 1903 | |
| 0.0002 | 1 | < 0.1% |
| 0.0003 | 1 | < 0.1% |
| 0.0004 | 1 | < 0.1% |
| 0.0007 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 44 | |
| 0.9959 | 1 | < 0.1% |
| 0.995 | 1 | < 0.1% |
| 0.9942 | 1 | < 0.1% |
| 0.9936 | 1 | < 0.1% |
CURROPER
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 KiB |
| 1 | |
|---|---|
| 0 | 578 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7535 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7535 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7535 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7535 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 6957 | |
| 0 | 578 | 7.7% |
| Distinct | 4422 |
|---|---|
| Distinct (%) | 64.6% |
| Missing | 686 |
| Missing (%) | 9.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5306430574 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 47 |
| Zeros (%) | 0.6% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.17 |
| Q1 | 0.3578 |
| median | 0.5215 |
| Q3 | 0.7129 |
| 95-th percentile | 0.89636 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3551 |
Descriptive statistics
| Standard deviation | 0.2255443558 |
|---|---|
| Coefficient of variation (CV) | 0.425039681 |
| Kurtosis | -0.7846975123 |
| Mean | 0.5306430574 |
| Median Absolute Deviation (MAD) | 0.1763 |
| Skewness | 2.474414699 × 105 |
| Sum | 3634.3743 |
| Variance | 0.05087025644 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 66 | 0.9% |
| 0 | 47 | 0.6% |
| 0.5 | 30 | 0.4% |
| 0.8 | 14 | 0.2% |
| 0.75 | 13 | 0.2% |
| 0.4 | 12 | 0.2% |
| 0.6667 | 11 | 0.1% |
| 0.625 | 11 | 0.1% |
| 0.7143 | 10 | 0.1% |
| 0.8333 | 9 | 0.1% |
| Other values (4412) | 6626 | |
| (Missing) | 686 | 9.1% |
| Value | Count | Frequency (%) |
| 0 | 47 | |
| 0.0041 | 1 | < 0.1% |
| 0.0124 | 1 | < 0.1% |
| 0.0179 | 1 | < 0.1% |
| 0.0194 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 66 | |
| 0.9985 | 1 | < 0.1% |
| 0.9964 | 1 | < 0.1% |
| 0.9941 | 1 | < 0.1% |
| 0.9937 | 1 | < 0.1% |
| Distinct | 4155 |
|---|---|
| Distinct (%) | 60.7% |
| Missing | 686 |
| Missing (%) | 9.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5222108629 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 687 |
| Zeros (%) | 9.1% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.3329 |
| median | 0.5833 |
| Q3 | 0.745 |
| 95-th percentile | 0.89792 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.4121 |
Descriptive statistics
| Standard deviation | 0.2836155209 |
|---|---|
| Coefficient of variation (CV) | 0.5431053642 |
| Kurtosis | -0.8120751355 |
| Mean | 0.5222108629 |
| Median Absolute Deviation (MAD) | 0.1902 |
| Skewness | -0.5229443778 |
| Sum | 3576.6222 |
| Variance | 0.08043776368 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 687 | 9.1% |
| 1 | 55 | 0.7% |
| 0.5 | 17 | 0.2% |
| 0.75 | 15 | 0.2% |
| 0.8 | 14 | 0.2% |
| 0.8333 | 11 | 0.1% |
| 0.6 | 10 | 0.1% |
| 0.6667 | 10 | 0.1% |
| 0.5556 | 8 | 0.1% |
| 0.8182 | 7 | 0.1% |
| Other values (4145) | 6015 | |
| (Missing) | 686 | 9.1% |
| Value | Count | Frequency (%) |
| 0 | 687 | |
| 0.0006 | 1 | < 0.1% |
| 0.0011 | 1 | < 0.1% |
| 0.0018 | 1 | < 0.1% |
| 0.0024 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 55 | |
| 0.9976 | 1 | < 0.1% |
| 0.9953 | 1 | < 0.1% |
| 0.9944 | 1 | < 0.1% |
| 0.9941 | 1 | < 0.1% |
| Distinct | 4285 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 817 |
| Missing (%) | 10.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4100211968 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 51 |
| Zeros (%) | 0.7% |
| Memory size | 59.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0374 |
| Q1 | 0.2415 |
| median | 0.40075 |
| Q3 | 0.572275 |
| 95-th percentile | 0.8 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.330775 |
Descriptive statistics
| Standard deviation | 0.2289391657 |
|---|---|
| Coefficient of variation (CV) | 0.5583593421 |
| Kurtosis | -0.672285885 |
| Mean | 0.4100211968 |
| Median Absolute Deviation (MAD) | 0.16515 |
| Skewness | 0.1626538357 |
| Sum | 2754.5224 |
| Variance | 0.05241314159 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51 | 0.7% |
| 0.5 | 29 | 0.4% |
| 0.3333 | 22 | 0.3% |
| 0.6667 | 19 | 0.3% |
| 0.4 | 15 | 0.2% |
| 0.4286 | 15 | 0.2% |
| 0.4444 | 13 | 0.2% |
| 0.25 | 12 | 0.2% |
| 1 | 12 | 0.2% |
| 0.5556 | 12 | 0.2% |
| Other values (4275) | 6518 | |
| (Missing) | 817 | 10.8% |
| Value | Count | Frequency (%) |
| 0 | 51 | |
| 0.0005 | 1 | < 0.1% |
| 0.0006 | 1 | < 0.1% |
| 0.0007 | 1 | < 0.1% |
| 0.0008 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 0.9896 | 1 | < 0.1% |
| 0.988 | 1 | < 0.1% |
| 0.9844 | 1 | < 0.1% |
| 0.9828 | 1 | < 0.1% |
| Distinct | 598 |
|---|---|
| Distinct (%) | 9.3% |
| Missing | 1122 |
| Missing (%) | 14.9% |
| Memory size | 59.0 KiB |
| PrivacySuppressed | |
|---|---|
| 38800 | 151 |
| 21500 | 97 |
| 49200 | 78 |
| 27400 | 46 |
| Other values (593) |
Length
| Max length | 17 |
|---|---|
| Median length | 5 |
| Mean length | 6.541712147 |
| Min length | 4 |
Characters and Unicode
| Total characters | 41952 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 132 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 30300 |
|---|---|
| 2nd row | 39700 |
| 3rd row | 40100 |
| 4th row | 45500 |
| 5th row | 26600 |
| Value | Count | Frequency (%) |
| PrivacySuppressed | 822 | 10.9% |
| 38800 | 151 | 2.0% |
| 21500 | 97 | 1.3% |
| 49200 | 78 | 1.0% |
| 27400 | 46 | 0.6% |
| 37000 | 45 | 0.6% |
| 28800 | 42 | 0.6% |
| 29700 | 42 | 0.6% |
| 25300 | 38 | 0.5% |
| 22400 | 34 | 0.5% |
| Other values (588) | 5018 | |
| (Missing) | 1122 | 14.9% |
| Value | Count | Frequency (%) |
| privacysuppressed | 822 | 12.8% |
| 38800 | 151 | 2.4% |
| 21500 | 97 | 1.5% |
| 49200 | 78 | 1.2% |
| 27400 | 46 | 0.7% |
| 37000 | 45 | 0.7% |
| 28800 | 42 | 0.7% |
| 29700 | 42 | 0.7% |
| 25300 | 38 | 0.6% |
| 29400 | 34 | 0.5% |
| Other values (588) | 5018 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12228 | |
| 2 | 3012 | 7.2% |
| 3 | 2673 | 6.4% |
| 4 | 1934 | 4.6% |
| 1 | 1868 | 4.5% |
| r | 1644 | 3.9% |
| p | 1644 | 3.9% |
| e | 1644 | 3.9% |
| s | 1644 | 3.9% |
| 5 | 1442 | 3.4% |
| Other values (13) | 12219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 27978 | |
| Lowercase Letter | 12330 | |
| Uppercase Letter | 1644 | 3.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| r | 1644 | |
| p | 1644 | |
| e | 1644 | |
| s | 1644 | |
| i | 822 | |
| v | 822 | |
| a | 822 | |
| c | 822 | |
| y | 822 | |
| u | 822 |
| Value | Count | Frequency (%) |
| 0 | 12228 | |
| 2 | 3012 | 10.8% |
| 3 | 2673 | 9.6% |
| 4 | 1934 | 6.9% |
| 1 | 1868 | 6.7% |
| 5 | 1442 | 5.2% |
| 8 | 1331 | 4.8% |
| 9 | 1171 | 4.2% |
| 6 | 1165 | 4.2% |
| 7 | 1154 | 4.1% |
| Value | Count | Frequency (%) |
| P | 822 | |
| S | 822 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27978 | |
| Latin | 13974 |
Most frequent character per script
| Value | Count | Frequency (%) |
| r | 1644 | |
| p | 1644 | |
| e | 1644 | |
| s | 1644 | |
| P | 822 | 5.9% |
| i | 822 | 5.9% |
| v | 822 | 5.9% |
| a | 822 | 5.9% |
| c | 822 | 5.9% |
| y | 822 | 5.9% |
| Other values (3) | 2466 |
| Value | Count | Frequency (%) |
| 0 | 12228 | |
| 2 | 3012 | 10.8% |
| 3 | 2673 | 9.6% |
| 4 | 1934 | 6.9% |
| 1 | 1868 | 6.7% |
| 5 | 1442 | 5.2% |
| 8 | 1331 | 4.8% |
| 9 | 1171 | 4.2% |
| 6 | 1165 | 4.2% |
| 7 | 1154 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41952 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 12228 | |
| 2 | 3012 | 7.2% |
| 3 | 2673 | 6.4% |
| 4 | 1934 | 4.6% |
| 1 | 1868 | 4.5% |
| r | 1644 | 3.9% |
| p | 1644 | 3.9% |
| e | 1644 | 3.9% |
| s | 1644 | 3.9% |
| 5 | 1442 | 3.4% |
| Other values (13) | 12219 |
| Distinct | 2038 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 32 |
| Missing (%) | 0.4% |
| Memory size | 59.0 KiB |
| PrivacySuppressed | |
|---|---|
| 9500 | |
| 27000 | 306 |
| 25827.5 | 136 |
| 25000 | 124 |
| Other values (2033) |
Length
| Max length | 17 |
|---|---|
| Median length | 5 |
| Mean length | 7.392376383 |
| Min length | 4 |
Characters and Unicode
| Total characters | 55465 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1542 ? |
|---|---|
| Unique (%) | 20.6% |
Sample
| 1st row | 33888 |
|---|---|
| 2nd row | 21941.5 |
| 3rd row | 23370 |
| 4th row | 24097 |
| 5th row | 33118.5 |
| Value | Count | Frequency (%) |
| PrivacySuppressed | 1510 | 20.0% |
| 9500 | 514 | 6.8% |
| 27000 | 306 | 4.1% |
| 25827.5 | 136 | 1.8% |
| 25000 | 124 | 1.6% |
| 12000 | 118 | 1.6% |
| 20000 | 90 | 1.2% |
| 9833 | 89 | 1.2% |
| 14144 | 88 | 1.2% |
| 36173.5 | 81 | 1.1% |
| Other values (2028) | 4447 |
| Value | Count | Frequency (%) |
| privacysuppressed | 1510 | 20.1% |
| 9500 | 514 | 6.9% |
| 27000 | 306 | 4.1% |
| 25827.5 | 136 | 1.8% |
| 25000 | 124 | 1.7% |
| 12000 | 118 | 1.6% |
| 20000 | 90 | 1.2% |
| 9833 | 89 | 1.2% |
| 14144 | 88 | 1.2% |
| 36173.5 | 81 | 1.1% |
| Other values (2028) | 4447 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7539 | 13.6% |
| 5 | 3827 | 6.9% |
| 2 | 3615 | 6.5% |
| 1 | 3368 | 6.1% |
| r | 3020 | 5.4% |
| p | 3020 | 5.4% |
| e | 3020 | 5.4% |
| s | 3020 | 5.4% |
| 3 | 2113 | 3.8% |
| 7 | 2011 | 3.6% |
| Other values (14) | 20912 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28946 | |
| Lowercase Letter | 22650 | |
| Uppercase Letter | 3020 | 5.4% |
| Other Punctuation | 849 | 1.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| r | 3020 | |
| p | 3020 | |
| e | 3020 | |
| s | 3020 | |
| i | 1510 | |
| v | 1510 | |
| a | 1510 | |
| c | 1510 | |
| y | 1510 | |
| u | 1510 |
| Value | Count | Frequency (%) |
| 0 | 7539 | |
| 5 | 3827 | |
| 2 | 3615 | |
| 1 | 3368 | |
| 3 | 2113 | 7.3% |
| 7 | 2011 | 6.9% |
| 9 | 1922 | 6.6% |
| 6 | 1636 | 5.7% |
| 8 | 1474 | 5.1% |
| 4 | 1441 | 5.0% |
| Value | Count | Frequency (%) |
| P | 1510 | |
| S | 1510 |
| Value | Count | Frequency (%) |
| . | 849 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29795 | |
| Latin | 25670 |
Most frequent character per script
| Value | Count | Frequency (%) |
| r | 3020 | |
| p | 3020 | |
| e | 3020 | |
| s | 3020 | |
| P | 1510 | 5.9% |
| i | 1510 | 5.9% |
| v | 1510 | 5.9% |
| a | 1510 | 5.9% |
| c | 1510 | 5.9% |
| y | 1510 | 5.9% |
| Other values (3) | 4530 |
| Value | Count | Frequency (%) |
| 0 | 7539 | |
| 5 | 3827 | |
| 2 | 3615 | |
| 1 | 3368 | |
| 3 | 2113 | 7.1% |
| 7 | 2011 | 6.7% |
| 9 | 1922 | 6.5% |
| 6 | 1636 | 5.5% |
| 8 | 1474 | 4.9% |
| 4 | 1441 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55465 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 7539 | 13.6% |
| 5 | 3827 | 6.9% |
| 2 | 3615 | 6.5% |
| 1 | 3368 | 6.1% |
| r | 3020 | 5.4% |
| p | 3020 | 5.4% |
| e | 3020 | 5.4% |
| s | 3020 | 5.4% |
| 3 | 2113 | 3.8% |
| 7 | 2011 | 3.6% |
| Other values (14) | 20912 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| INSTNM | CITY | STABBR | HBCU | MENONLY | WOMENONLY | RELAFFIL | SATVRMID | SATMTMID | DISTANCEONLY | UGDS | UGDS_WHITE | UGDS_BLACK | UGDS_HISP | UGDS_ASIAN | UGDS_AIAN | UGDS_NHPI | UGDS_2MOR | UGDS_NRA | UGDS_UNKN | PPTUG_EF | CURROPER | PCTPELL | PCTFLOAN | UG25ABV | MD_EARN_WNE_P10 | GRAD_DEBT_MDN_SUPP | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Alabama A & M University | Normal | AL | 1.0 | 0.0 | 0.0 | 0 | 424.0 | 420.0 | 0.0 | 4206.0 | 0.0333 | 0.9353 | 0.0055 | 0.0019 | 0.0024 | 0.0019 | 0.0000 | 0.0059 | 0.0138 | 0.0656 | 1 | 0.7356 | 0.8284 | 0.1049 | 30300 | 33888 |
| 1 | University of Alabama at Birmingham | Birmingham | AL | 0.0 | 0.0 | 0.0 | 0 | 570.0 | 565.0 | 0.0 | 11383.0 | 0.5922 | 0.2600 | 0.0283 | 0.0518 | 0.0022 | 0.0007 | 0.0368 | 0.0179 | 0.0100 | 0.2607 | 1 | 0.3460 | 0.5214 | 0.2422 | 39700 | 21941.5 |
| 2 | Amridge University | Montgomery | AL | 0.0 | 0.0 | 0.0 | 1 | NaN | NaN | 1.0 | 291.0 | 0.2990 | 0.4192 | 0.0069 | 0.0034 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.2715 | 0.4536 | 1 | 0.6801 | 0.7795 | 0.8540 | 40100 | 23370 |
| 3 | University of Alabama in Huntsville | Huntsville | AL | 0.0 | 0.0 | 0.0 | 0 | 595.0 | 590.0 | 0.0 | 5451.0 | 0.6988 | 0.1255 | 0.0382 | 0.0376 | 0.0143 | 0.0002 | 0.0172 | 0.0332 | 0.0350 | 0.2146 | 1 | 0.3072 | 0.4596 | 0.2640 | 45500 | 24097 |
| 4 | Alabama State University | Montgomery | AL | 1.0 | 0.0 | 0.0 | 0 | 425.0 | 430.0 | 0.0 | 4811.0 | 0.0158 | 0.9208 | 0.0121 | 0.0019 | 0.0010 | 0.0006 | 0.0098 | 0.0243 | 0.0137 | 0.0892 | 1 | 0.7347 | 0.7554 | 0.1270 | 26600 | 33118.5 |
| 5 | The University of Alabama | Tuscaloosa | AL | 0.0 | 0.0 | 0.0 | 0 | 555.0 | 565.0 | 0.0 | 29851.0 | 0.7825 | 0.1119 | 0.0348 | 0.0106 | 0.0038 | 0.0009 | 0.0261 | 0.0268 | 0.0026 | 0.0844 | 1 | 0.2040 | 0.4010 | 0.0853 | 41900 | 23750 |
| 6 | Central Alabama Community College | Alexander City | AL | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0.0 | 1592.0 | 0.7255 | 0.2613 | 0.0044 | 0.0025 | 0.0044 | 0.0000 | 0.0000 | 0.0000 | 0.0019 | 0.3882 | 1 | 0.5892 | 0.3977 | 0.3153 | 27500 | 16127 |
| 7 | Athens State University | Athens | AL | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0.0 | 2991.0 | 0.7823 | 0.1200 | 0.0191 | 0.0053 | 0.0157 | 0.0010 | 0.0174 | 0.0057 | 0.0334 | 0.5517 | 1 | 0.4088 | 0.6296 | 0.6410 | 39000 | 18595 |
| 8 | Auburn University at Montgomery | Montgomery | AL | 0.0 | 0.0 | 0.0 | 0 | 486.0 | 509.0 | 0.0 | 4304.0 | 0.5328 | 0.3376 | 0.0074 | 0.0221 | 0.0044 | 0.0016 | 0.0297 | 0.0397 | 0.0246 | 0.2853 | 1 | 0.4192 | 0.5803 | 0.2930 | 35000 | 21335 |
| 9 | Auburn University | Auburn | AL | 0.0 | 0.0 | 0.0 | 0 | 575.0 | 588.0 | 0.0 | 20514.0 | 0.8507 | 0.0704 | 0.0248 | 0.0227 | 0.0074 | 0.0000 | 0.0000 | 0.0100 | 0.0140 | 0.0862 | 1 | 0.1610 | 0.3494 | 0.0415 | 45700 | 21831 |
Last rows
| INSTNM | CITY | STABBR | HBCU | MENONLY | WOMENONLY | RELAFFIL | SATVRMID | SATMTMID | DISTANCEONLY | UGDS | UGDS_WHITE | UGDS_BLACK | UGDS_HISP | UGDS_ASIAN | UGDS_AIAN | UGDS_NHPI | UGDS_2MOR | UGDS_NRA | UGDS_UNKN | PPTUG_EF | CURROPER | PCTPELL | PCTFLOAN | UG25ABV | MD_EARN_WNE_P10 | GRAD_DEBT_MDN_SUPP | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7525 | Strayer University-North Dallas | Dallas | TX | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 36173.5 |
| 7526 | Strayer University-San Antonio | San Antonio | TX | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 36173.5 |
| 7527 | Strayer University-Stafford | Stafford | TX | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 36173.5 |
| 7528 | WestMed College - Merced | Merced | CA | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 15623.5 |
| 7529 | Vantage College | El Paso | TX | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 9500 |
| 7530 | SAE Institute of Technology San Francisco | Emeryville | CA | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 9500 |
| 7531 | Rasmussen College - Overland Park | Overland Park | KS | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 21163 |
| 7532 | National Personal Training Institute of Cleveland | Highland Heights | OH | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 6333 |
| 7533 | Bay Area Medical Academy - San Jose Satellite Location | San Jose | CA | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | PrivacySuppressed |
| 7534 | Excel Learning Center-San Antonio South | San Antonio | TX | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1 | NaN | NaN | NaN | NaN | 12125 |